Methods of parallel text data clustering algorithm implementation
نویسندگان
چکیده
منابع مشابه
Parallel Implementation of Genetic Algorithm using K-Means Clustering
-----------------------------------------------------------------ABSTRACT-------------------------------------------------------The existing clustering algorithm has a sequential execution of the data. The speed of the execution is very less and more time is taken for the execution of a single data. A new algorithm Parallel Implementation of Genetic Algorithm using KMeans Clustering (PIGAKM) is...
متن کاملUse of the Improved Frog-Leaping Algorithm in Data Clustering
Clustering is one of the known techniques in the field of data mining where data with similar properties is within the set of categories. K-means algorithm is one the simplest clustering algorithms which have disadvantages sensitive to initial values of the clusters and converging to the local optimum. In recent years, several algorithms are provided based on evolutionary algorithms for cluster...
متن کاملA Fuzzy C-means Algorithm for Clustering Fuzzy Data and Its Application in Clustering Incomplete Data
The fuzzy c-means clustering algorithm is a useful tool for clustering; but it is convenient only for crisp complete data. In this article, an enhancement of the algorithm is proposed which is suitable for clustering trapezoidal fuzzy data. A linear ranking function is used to define a distance for trapezoidal fuzzy data. Then, as an application, a method based on the proposed algorithm is pres...
متن کاملEvaluating Text Clustering Methods for Text Classification
In this project report, I will evaluate the several text clustering approaches and how they can be used for the purpose of text classification. The particular task is topic classification of 20 Newsgroup dataset and sentiment classification restaurant reviews dataset. Future direction for improving the results will also be discussed.
متن کاملApplication of Parallel Annealing Particle Clustering Algorithm in Data Mining
With development of the computer technology, the large-scale calculation problems are often appeared in the network, it needs a lot of system resources and support of hardware, it often bring troubles in engineering optimization, so it needs apply the method such as the group's global optimization method and its improved algorithm to obtain reliable results in the computer system. In the study,...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Technology audit and production reserves
سال: 2015
ISSN: 2312-8372,2226-3780
DOI: 10.15587/2312-8372.2015.37422